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Abstract 

We describe the statistics of repetition times of a string of symbols in 
a stochastic process. 

Denote by ta the time elapsed until the process spells the finite string 
A and by Sa the number of consecutive repetitions of A. We prove that, 
if the length of the string grows unbondedly, (1) the distribution of ta, 
when the process starts with A, is well approximated by a certain mixture 
of the point measure at the origin and an exponential law, and (2) Sa is 
approximately geometrically distributed. We provide sharp error terms 
for each of these approximations. The errors we obtain are point-wise and 
allow to get also approximations for all the moments of ta and Sa- To 
obtain (1) we assume that the process is (/>-mixing while to obtain (2) we 
assume the convergence of certain contidional probabilities. 

Keywords: Mixing, recurrence, rare event, return time, sojourn time. 
Running head: Return times under mixing conditions. 



1 INTRODUCTION 

This paper describes the return time statistics of a string of symbols in a 
mixing stochastic process with a finite alphabet. Generally speaking, the study 
of the time elapsed until the first occurrence of a small probability event has a 
long history. See for instance [10] and the references therein. The typical result 
is: 

lim IP iTA„ > t b,, \ ^io) = e-* . (1.1) 

n — ^oo 

Here ta„ is the first time the process hits a given measurable set An, n e IN. We 
assume that the measure IP (An) goes to zero as n — > oo. {bnjnejv is a suitable 
re-scaling sequence of positive numbers and /io is a given initial condition. 

Recently an exhaustive analysis of these statistics was motivated by ap- 
plications in different areas as entropy estimation, genome analysis, computer 
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science, linguistic, among others. From the point of view of apphcations, a fun- 
damental task is to understand the rate of convergence of the limit A 
detailed review of such results appearing in the literature can be found in ^ . 

It is the purpose of this paper to present the following new results: For any 
string A of lenght n 

- A sharp upper bound for the above rate of convergence that holds when /iq — A. 
In this case we assume that the process is (/)-mixing . 

- A sharp upper bound for the difference between the law of the number of 
consecutive visits to A and a geometric law. In this case we assume a kind of 
continuity for certain conditional probabilities, see (jS.ip . on the 

When /io is taken as A, we refer to the distribution IP{ta > i | ^) as the 
return time. In general it can not be well approximated by an exponential law. 
This was firstly noted by Hirata, when he proved the convergence of the number 
of visits to a small cylinder around a point to the Poisson law. His result holds 
for axiom A diffeomorphisms (see [H]). The result holds for almost every point. 
Then, he proved that for periodic points, the asymptotic limit law of the return 
time differs from the one- level Poisson law, namely e^*. 

Our first result concerns the rate of convergence of limit in (|l.ip when iiq = A 
for any string A of lenght n.. We prove that the return time law converges to 
a convex combination of a Dirac law at the origin and an exponential law. 
Specifically, we show that for large n 



t{A) is the position of the first overlap of A with a copy of itself (see definition 
below). is a parameter related to the overlap properties of the string A. 
It is worth noting that the parameter of the exponential law is exactly the 
weight of the convex combination. So far, the overlap properties of a string 
appears as a major factor to describe the statistical properties of the return 
time. For instance, if a string overlaps itself, then it will turn out in the sequel 
that Ca 7^ 1 a-iid the return time distribution approximates the above mixture 
of laws. However, for a word which does not overlap itself, it will turn out that 
Ca = ^ and the return time distribution approximates a purely exponential law. 
For the role of overlaps an a treatment of the independent case with a good 
introduction to the previous literature see [5], and for the Markov case with a 
probability generating functions point of view see [16| . 

It is worth recalling at this point that when in equation (jl.ip the initial 
condition is the equilibrium measure of the process, ta is called the hitting time 
of A. In [12] it is proved a rate of convergence of the return time as function of 
the distance between the hitting time and return time laws. While this result 
applies only for cylinders around non-periodic points, our result applies to all 
of them. 

The great enhancement of our work is that, contrarily to all the previous 
works which present bounds depending only on the string A, our error estimate 
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decays exponentially fast in t for alH > 0. As a byproduct we obtain explicit 
expressions for all the moments of the return time. This also appears as a 
generalization of the famous Kac's lemma (see [T3]) which states that the first 
moment of the return time to a string A of positive measure is equal to ]P{A)~'^ 
and the result in [J which presents conditions for the existence of the moments 
of return times. Further, [T2] proves that hitting and return times coincide if 
and only if the return time converges to the exponential law. We extend this 
result establishing that the laws of hitting and return times coincide if and only 
if the weight of the Dirac measure in the convex combination of the return time 
law is zero, which is equivalent to consider a non-overlapping string. 

Our framework is the class of 0-mixing processes. For instance, irreducible 
and aperiodic finite state Markov chains are known to be ^-mixing (and then </)- 
mixing ) with exponential decay. Moreover, Gibbs states which have summable 
variations are "^-mixing (see [17j). They have exponential decay if they have 
Holder continuous potential (see [6]). However, sometimes the -(/'-mixing condi- 
tion is very restricted hypothesis difficult to test. We establish our result under 
the more general (/)-mixing condition. Further examples of ^-mixing processes 
can be found in [Mj . The error term is explicitly expressed as a function of the 
mixing rate 0. We refer the reader to [§] for a source of examples and definitions 
of the several kinds of mixing processes. 

The base of our proof is a sharp upper bound on the rate of convergence of 
the hitting time to an exponential law proved in [5]. 

The self-repeating phenomena in the distribution of the return time leads us 
to consider the problem of the sojourn time. Our second result states that the 
law of the number of consecutive repetitions of the string A, denoted by Sa, 
converges to a geometric law. Namely 

]P{SA^k\A)^il~p{A))piA)'^. (1.2) 

Again here, the parameter p{A) depends on the overlap properties of the string. 
Furthermore we show that under suitable conditions one has p{A) « 1 — ^a. As 
far as we know, this is the first result on this subject for dependent processes. 

As in our previous result, the error bound we obtain decreases geometrically 
fast in k (see (11.21) '). This decay on the error bound allows us to obtain an 
approximation for all the moments of Sa for those of a geometrically distributed 
random variable. 

Our results are applied in a forthcoming paper: In the authors prove large 
deviations and fluctuations properties of the repetition time function introduced 
by Wyner and Ziv in ^18j and further by Ornstein and Weiss in and get 
entropy estimators. 

This paper is organized as follows. In section 2 we establish our framework. 
In section 3 we describe the self-repeating properties needed to state the return 
time result. In section 4 we establish the approximation for the return time 
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law. This is Theorem 14. II Finally, in section 5 we state and prove the geometric 
approximation for the consecutive repetitions of a string. This is Theorem 15.11 

2 FRAMEWORK AND NOTATION 

Let C be a finite set. Put fl = . For each x — {xm)m£Z G ^ and m G Z, 
let Xfn : fl ^ C he the m-th coordinate projection, that is Xm{x) — Xm- We 
denote by T : ^ £7 the one-step- left shift operator, namely {T(x))m — Xm+i- 

We denote by J- the tr-algebra over generated by strings. Moreover we 
denote by Ti the cr-algebra generated by strings with coordinates in /, / C Z. 

For a subset A C f2, A e C" if and only if 

A = {Xq = flo; . . . ; Xn-i — fln-i} , 

with ai Cz i — 0, . . . ,n — 1. 

We consider an invariant probability measure IP over We shall assume 
without loss of generality that there is no singleton of probability 0. 

For two measurable sets V and W, we denote as usual IP {V\W) = IPw {V) = 
IP {V; W) /1P{W) the conditional measure of V given W. We write IP {V; W) = 

plvnw). 

We say that the process {Xm}^^^ (/i-mixing if the sequence 
cl>{l)^snp\lPB{C)~lP{C)\ , 
converges to zero. The supremum is taken over B and C such that B g 

^{Q,.,n},'"' e 1N,]P{B) > 0, C e ^{meWI m>n-|-i-|-l}- 

We use the measure theoretic notation: {X™ = a;™} = {Xn = a;„, . . . , Xm = 
Xm}- For an n-string A = {Xq^^ = Xq^^} and 1 < u> < n, we write A^"^^ = 
{X^^zIj = x^zIj} for the w-string belonging to the cr-algebra y-{n-w,...,n-i} and 
consisting of the last w symbols of A. We write = i^\V, for the complement 
oiV. 

The conditional mean of a r.v. X with respect to any measurable set V 
will be denoted by IEv{X) and we put 1E{X) when V — ft. Wherever it is 
not ambiguous we will write C for different positive constants even in the same 
sequence of equalities/inequalities. For brevity we put (a V 6) = max{a,6} and 
(a A 6) = min{a, b}. 

3 PERIODS 

Definition 3.1 Let A G C„. We define the period of A (with respect to T) as 
the number t{A) defined as follows: 

t{A) = min {fc e{l,...,n}\An T-'=(A) ^ 0} . 
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By definition, if A G C„, then A = (oo, . . . , a„_i), £ C for < i < n — 1. 
For instance, pick up A = (aaaabbaaaabbaaa) G C^^. Then shift a copy of A 
until there is a fit between them. Namely 

A = aaaabb aaaabbaaa 
T~^{A) = aaaabbaaa abbaaa 

Notice that there is no fit between A and T~^{A) if j = 1,...,5. So that 
TiA) = 6. 

Let us take A G C„, and write n = qT{A) + r, with q = [n/T{A)] and 
< r < t{A). Thus 

A _ ( yt(A)-1 _ -y2r{A)-l _ _ y«r(A)-l _ r{A}-l . yn-1 _ „r-l\ 
^-\^0 — ^t{A) - • • • - ^(g-l)T(A) - "O ' ^gr(A) — "O / ' 

So, we say that A has period t{A) and rest r. Wc remark that periods can be 
"read backward" (and for the purpose of section 5 it will be more useful to do 
it in this way), that is 

A — ( V''-l _ y"-(9-l)'r(^)-l _ _ T^n--r(A)-l _ yra-1 _ _n-l \ 

^— — "'n-r:^n-qT{A) ~ " ~ ^n-2r(A) " ^Ti-r(A) " "n-r(A) / 

{q-l)T{A) 

= Pi 7^J-r(-4)(-^(i-(>l))') PI y9T(A)j-^(r)-j _ 

We recall the definition of A^"'), 1 < w < n, from the end of section 2. For 
instance, using the previously chosen A, 

period period j.gg^ rest period period 

A = (aaaabb aaaabb aiaa) = (^liaa^ abbaaa abbaaa ) . (3.1) 

-2-12^(3) ^6^(6) ^(6) 

In the middle of the above equality, periods are read forward while in the right 
hand side periods are read backward. 

Consider the set of overlapping positions of A: 
{ke{l,...,n-l}\An T-'^{A) ^ 0} = {r(A), . . . , [n/r(A)]r(^)} U UiA) , 
where 

n{A) = {ke {[n/T{A)]T{A) + 1, . . . , n - 1} I A n T-'=(A) ^ 0} . 

The set {t{A), . . . , [n/T{A)]T{A)} is called the set of principal periods of A while 
Tl{A) is called the set of secondary periods of A. Furthermore, put ta = #TZ{A). 
Observe that one has < < n/2. 

The notion of period is related to the notion of retun times. 
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Definition 3.2 Given A E C„, we define the hitting time ta '■ ^ ^ IN U {00} 

as the following random variable: For any x £ £7 

ta{x) = inf{fc >1:T''{x) eA} . 
The return time is the hitting time restricted to the set A, namely ta\a- 

We remark the difference between ta and t{A): while ta{x) is the first time 
A appears in .x, t{A) is the first overlapping position of A. 

Return times before t{A) are not possible, thus, IP a {ta < t{A)) — 0. Still, 
if A does not return at time t{A), then it can not return at times kT{A), with 
2 < k < [n/T{A)], so one has 

Pa {t{A) <ta< [n/T{A)]T{A)) = 0. 

The first possible return time after t{A) is 

_ J min7^(A) n{A) ^ 

~ \ n UiA) = • 

Furthermore, by definition of 'TZ{A) one has Af]T~^ {A) = for all j such that 
[n/T{A)]T{A) < j < n - 1 and i ^ n{A). Thus 

Pa i{[n/T{A)]T{A) + 1 < ta < n - 1} n {ta ^ = 0. 

We finally remark that 

T-'A n T-^A = Vi, J e n{A) . 

Otherwise it would contradict the fact that the first return time to A is t{A) 
since for i,j S Ti-iA) one has \i — j\ < t{A). We conclude that 

Pa {T-'A n T-^A \i,je 7^(A)) = 0. (3.2) 

4 RETURN TIMES 

For AeCn define 

Ca ^= Pa{ta ¥^ t{A)) = Pa{ta > t{A)) . 

The equality follows by the comment at the end of the previous section. 

It would be useful for the reader to note now that according to the comments 
of the previous section, one has 

ta\a e {T{A)}yjn{A)\j{k&]N \k>n} . (4.1) 

We now introduce the error terms that appear in the statement of our main 
result of this section. 
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Definition 4.1 Let us define 

e(v4) =^ inf \{2n + T{A))]P{A^'"^) + (j){nA''w)] . (4.2) 

Theorem 4.1 Let {Xm}jnez 4>-mixing process. Then, for all A G C",n £ 
]N the following inequality holds for all t: 

IP A [ta >t)- ^t<riA)} - l{t>r(A)}CAe-'^---^(^)(*-^(-^»| < 5Ae{A)f{A,t), 

(4.3) 

where f{A,t) = iP(A)te-(';^"i6e(A))j^(A)t^ 

We postpone an example showing the sharpness of e(^) after Lemma 14.21 

Remark 4.1 ^t"-^) is the part of the string A which does not overlap itself in 
A n T^^^A. Note that ua is the position of the first possible return time after 
t{A). Recall that rA = #7^(A) and ua = n if n{A) = 0. Thus A^"') with 
1 < w < TiA is the part of the string A^"^^ after taking out its first ua — w 
letters (this will be to create a gap of length ua — w to use the mixing property). 

Remark 4.2 When TZ{A) = 0, namely, A does not have secondary periods, the 
error e{A) of Theorem \4.1\ becomes info<io<„ ^nIP{A'^'^^) + 0(n — w)] . 

Remark 4.3 In the error term of the theorem, e{A) provides a bound which 
shows the convergence uniform in t of the return time law to that mixture of laws 
as the length of the string growths. The factor IP[A)t provides an extra bound 
for values oft smaller than \/]P{A). The factor e^^^-^^i^^^^)))-^^'^)* provides 
an extra bound for values oft larger than 1/IP{A). 

Remark 4.4 On one hand IP{A) < Ce~'^" (see 111). On the other hand, by 
construction ua > n/2. Further (fiin) — > as n — > oo. Taking for instance 
w = n/A in we warrant the smallness of e{A) for large enough n. 

Corollary 4.1 Let the process {Xm}„iez 4>-mixing. Let /? > 0. Then, for 
all A G C",n e IN, the (3-moment of the re-scaled time IP{A)ta approaches, as 
n oo, to r(/3 + . Moreover 



IP{Af]EA{r^A)'-^-^J^ 



<^(Aff"'""^^. (4.4, 

Ca 



where e*{A) = {c{A) V {nIP{A))^), C > is a constant and T is the analytic 
gamma function. 

Remark 4.5 In particular, the corollary establishes that all the moments of the 
return time are finite. 

Remark 4.6 In the special case when f3 ~ 1, the above corollary establishes a 
weak version of Kac's Lemma (see 'ISj). 



7 



Remark 4.7 For each f3 fixed and n large enough one has (3 e^'^'-^^'-''"'""'^-'/^^ is 
close to P/Ca- Thus in virtue of inequality the corollary reads not just as 

a difference result but also as a ratio result. 

The next corollary extends Theorem 2.1 in [12] . 

Corollary 4.2 Let the process {Xm\„iez 4>-mixing. There exists a constant 
C > such that, for all A € C",n ^ IN and all t > the following conditions 
are equivalent: 

(a) \IPa {ta > t) - e-^(-4)t| < ^ ^(^) ^(^^^) ^ 

(b) \1Pa {ta >t)-lP{TA>t)\<C e{A) f{A, t) , 

(c) \IP{ta> t) -e-^(-4)t| < c e{A) f{A,t) , 

(d) \U -1\<C e{A) . 

Moreover, if {An\ni^N is a sequence of strings such that ]P{An) —> as n — > oo, 
then the following conditions are equivalent: 

(a ) the return time law of A^ converges to a parameter one exponential law, 
(h) the return time law and the hitting time law of A„ converge to the same law, 

(c ) the hitting time law of A^ converges to a parameter one exponential law, 

(d) The sequence {C,An)neN converges to one. 

4.1 Preparatory results 

Here we collect a number of results that will be useful for the proof of The- 
orem H?T] In what follows and for shorthand notation we put fA — 1/(2J'(^)) 
(factor 2 is rather technical). The next lemma is a useful way to use the </>- 
mixing property. 

Lemma 4.1 Let {Xm}m^z ip-mixing process. Suppose that A 3 _B g 

•^{o,...,f)}i C' £ J^{x^N\ x>b+n] with b,g G IN. The following inequality holds: 

]PA{B;C)<PA{B){P{C) + 4>{n)) . 

Proof Since B <Z A, obviously 1P{A nBCiC) = IP{B n C). By the 0-mixing 
property ]P{B;C) < IP{B) {IP{C) + (j)(n)) . Dividing the above inequality by 
IP{A) the lemma follows. □ 

The following lemma says that return times over Ti-{A) have small probabil- 
ity. 

Lemma 4.2 Let {Xm}.^^^ be a (p-mixing process. For all A e C", the fol- 
lowing inequality holds: 

IP A {ta e n{A)) < e{A) . (4.5) 
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Proof For any w such that 1 < w < ua 



Pa (ta e n{A)) < IPaI [j T'' A 

< IPa\ \J T-^A^" 
\jen{A) 



< taIP + 0(nA - w) . (4.6) 



The first inequahty foUows by ([S^]). Since T-^A C T-^ A^""), second one follows. 
Third one follows by the above lemma with B = A and C — Ujg7i(^)T^^A*^"'^. 
This ends the proof since w is arbitrary. □ 

Example 4.1 Consider a process {Xm} „i^z defined on the alphabet C ~ {a, b}. 
Consider the string introduced in liS.l]) : 

A = {(X0...X14) = (aaaabbaaaabbaaa)} . 

Then, n = 15, t{A) = 6, TZ{A) = {13, 14}, = 2 and ua = 13. Thus 
^ {(X2...X14) = (aabbaaaabbaaa)} . 

The (j)-mixing property factorizes the probability 

Pa\ [j T-^a\ =1PaI [j r-^A(i3) I <1PaI [j T-^A^"- 

In such case, a gap at t = lb 0/ length w with < w < IS is the best we can do 
to apply the (p-mixing property. 

The next lemma will be used to get the non- uniform factor f{A, t) in the 
error term of Theorem 14.11 

Lemma 4.3 Let {^mlmg^ be a (f>-mixing process. Let A E and let B G 

^{x£N\ x>kfA}' ^^ih k G IN. Then the following ineguality holds: 

Pa {ta > kfA ; B)<[]P {ta > Ia - 2n) + (b{n)]''-' [IP{B) + (t>{n)] . 

Proof First introduce a gap of length 2n between {ta > kfA} and B. Then 
use Lemma |4. II to get the inequalities 

IPa{ta > kfA ; B) < 1Pa{ta > kfA - 2n ; B) 

< 1Pa{ta> kfA-2n)[IP{B) + cl>{n)] . (4.7) 
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Apply this procedure to {ta > {k ~ 1)/^} and B = {ta o T^^-'^'if^ > Ia- 2n} 
to bound ]Pa{ta > kfA — 2n) by 

IPa{ta >{k- l)fA - 2n) [IP{ta > /a - 2n) + . 

Iterate this procedure to bound IPa{ta > kfA — 2n) by 

IPAiTA >fA- 2n) [1P{ta >fA- 2n) + . 

This ends the proof of the Lemma. □ 

The next proposition estabhshes a relationship between hitting and return 
times with an error uniform in t. In particular, (b) says that they are close (up 
to 2e{A)) if and only if (^a is close to 1. 

Proposition 4.1 Let {^mlmg^ be a (j)-mixing processes. Let A e C„ and k 
a positive integer. Then the following holds: 

(a) For allO<r < fA, 

\IPa{ta > kfA + r)- IPa{ta > kfA)lP{TA > r)\ 
< 2e{A) Pa{ta> kfA-2n) . 

(b) For all i > t{A) e M, 

\IPa{ta >i)- UlP{rA > i)\ < HA) • (4.8) 

Proof To simplify notation, for t ^ Z we write rjj' to mean ta o T*. Assume 
r > 2n We introduce a gap of length 2n after coordinate t to construct the 
following triangule inequality 

\1Pa{ta > kfA + r) - Pa{ta > kfA)IP{TA > r)\ 
< \IPa{ta > kfA + r) - PAiTA > kfA; rjf^^+^n] ^ ^ _ ^n)] (4.9) 
+ \lPA{rA > kfA; T]f^^+2"1 > r - 2n) - IPa{ta > kfA)IP{TA > r - 2n)\ 

(4.10) 

+ PA{TA>kfA)\lP{TA>r~2n)-P{TA>r)\ . (4.11) 

(|4.9[) is bounded by a direct computation by IPa{ta > kfA;T]^'^^^ < 2n). 
This last quantity is bounded using (14. 7p by 

1Pa{ta > kfA - 2n) [2n]P{A) + 0(n)] . 

Term (j4.10p is bounded using the 0-mixing property by 

IPAiTA > kfA)4>{n) . 
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The modulus in (|4.1ip is bounded using stationarity by 

P{ta < 2n) < 2nIP{A) . 

If r < 2n, just change r — 2n by zero and the same proof holds. This ends the 
proof of (a). 

The proof of (b) is very similar to that previous one. We do it briefly. Write 
the following triangle inequality 

\]Pa{ta> i)~CA]P{rA> i)\ 
< \IPa{ta > - IPa{ta > T(A);Ti^(^)+'"l > z - t{A) - 2n)\ 
+ \IPA{rA > r(A); t][(^)+'"1 > z - t{A) - 2n) - UIP{rA > i - t{A) - 2n)\ 
+ CA\IP{TA>i-T{A)-2n)-IP{TA>i)\ . 

The moduli on the right hand side of the above inequality are bounded as follows. 
The first one by IPa{ta > t{A);t]^^^^^ < t{A) +2n-l) which is bounded by 
IPa{ta e n{A) U {n, . . . , t{A) + 2n~ 1}) . The cardinal of n{A) U {n, . . . , + 
2n — 1} is less or equal than n + t{A) + TZ{A). Therefore, the last expression is 
bounded following the proof of LemmalObv {2n + T{A))]P{A^'^'>) + 4>{nA-w) . 

The second one is bounded using the 0-mixing property by . 

The third one is bounded using stationarity by 

]P{ta < t{A) + 2n) < {t{A) + 2n)IP{A). 
This ends the proof of (b). □ 

The following proposition is the key of the proof of Theorem 14.11 

Proposition 4.2 Let {Xm}^^^ be a (f>-mixing process. Let A e C",n G IN 
and let k he any integer k > I. Then the following inequality holds: 

\1Pa{ta > kfA) - IPa{ta > fA)IP{TA > 
< 2eiA){k - \)IPa{ta > /a - 2n)[IP{TA > Ia - 2n) + ^{n)f -^ . 

Proof For k — 1 there is nothing to prove. Take k > 2. The left hand side of 
the above inequality is bounded by 

k 

\1Pa{ta > jJa) ~ 1Pa{ta > U - l)fA)lP{TA > fA)\lP{TA > fAf-'. 

The modulus in the above sum is bounded by 

2€{A)IPa[ta > {j~l)fA-2n) , 

due to Proposition [O] (a)- The right-most factor is bounded using Lemma 
by [F{ta > Ja - 2n) + ^(n)]^^^. The conclusion follows. □ 
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4.2 Proofs of Theorem 14.11 and corollaries 

Proof of Theorem 14.11 We divide the proof according to the different values 
of t: (i) t < t{A), (ii) t{A) <t< Ja and (ii) t > /a- 

Consider first t < t{A). (|4.ip says that the left hand side of (|4.3p is zero. 

Consider now t{A) < t < /a- First write 



(4.12) 



and 



]P{ta > t) 



where 



Further 



W IP{ta > i\TA >t-l) 

i=T{A) + l 

t 

Y[ (1 - 1P{T-'A\ta >i-l)) 

i=T{A) + l 

t 

n {i-p,ip{A)), 

i=T{A) + l 

def 1Pa{ta > i-l) 



IP{ta >i-l) 



(4.13) 



1 - p,IP{A) - e-«^^(^) I < - a I IP{A) + 1 1 - UIP{A) - e-f^-^(-4) 

(4.14) 

Firstly, by Proposition 14.11 (b) and the fact that 1P{ta > i) > 1/2 since i < 
/a = 1/{21P{A)) we have 



(4.15) 



Secondly, note that |1 — a; — e~^\ < for all < a; < 1. Apply it with 

X = CaP{A) to bound the most right term of ([414]) by (CA-ff'(^))^/2. Collecting 
the last two bounds we get 

|1 - p,]P{A) - e-';-^(^) I < ^eiA)]P{A) , V* = r(A) + 1, . . . , . 

Furthermore, since 

\Yla,-Y[b,\<inaic\a,-h\{#t)T[iaii{a,;b,}*'-^ VO < a„ 5, < 1 , (4.16) 
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we conclude from (|4.13p and (|4.12p that 

\]P{ta > t) - e-^-*^('4)(*-^(^»| < ^e{A)lP{A)t , (4.17) 

and 

\]PAirA >t)~ (;Ae-^APiA)it-riA))^ < ^eiA)PiA)t , (4.18) 
for all TA <t < Ja- This concludes this case. 

Consider now t > Ja- Write it as t = kjA + r with k a positive integer and 
< < /a- We do the following triangle inequality 

|iPA(rA>t)-ae-^-^(^)(*--(^»| 

< \IPA{rA>kfA + r)-IPA{rA>kfA)JP{TA>r)\ (4.19) 

+ \IPa{ta > kfA) - iPA(TA > fA)JP{TA > fAf-^\IP{TA > r) (4.20) 

+ \IPa{ta> fA)F{TA> fA)"-' -CAe-'^^''/^\]P{TA>r) (4.21) 

+ ae~'^^'=/2 |iP(rA > r) - | 22) 

By Proposition [57T] (a), the modulus in (|4.19p is bounded by 

2e{A)lPA{TA > kfA) , 

and by Lemma 4.3 

2e{A)PAiTA > kfA - 2n) < 2e{A){IP{TA > fA~ 2n) + (l){n)f-^ . 
The modulus in (|4.20p is bounded using Proposition 14.21 by 
2e{A){k - 1){1P{ta > fA ~ 2n) + ^(n))'^"^ 
Thus, the sum of ((i7Tg)l and is bounded by 

2e{A){IP{TA > fA- 2n) + <j){n)f-^[k + 0(n)] . (4.23) 

On one hand k + (f){n) < fe + 1 < 2k. On the other hand, applying (|4.17p with 

t = fA — 2n we get 

mrA > fA- 2n) - e-a/2+(2„+r(A))i=(A)| < 9^(^) 

Furthermore, by the Mean Value Theorem (MVT) we get 

|g-a/2+(2„+r(A))i^(A) _g-a/2| < (2n + r(yl))7P(A)e(2"+-(^))-P(^) . 

We conclude that for large enough n 

\1P{ta > fA- 2n) + (j){n) - e^'^-^/^ | < 4e(A) 
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And therefore (|4.23p is bounded by 

4e(A)fc(e"^-*/2 +4e(A))*=-2 . (4.24) 
A direct computation using Taylor's expansion gives 

Since t = {k/2IP{A)) + r we get 

which is bounded by 

g~CAJP(A)t+3/2 _ 

Similarly 

g-(CA/2-8e(A))(fe-2) ^ g-(C^-16e(yl))JP(A)t+(CA-16e(A))(J'(A)r+l) ^ 

which for large enough n is bounded by 

g-(a-16e(A))iP(A)t+3/2 _ 

Thus ((i?^ is bounded by 

To bound (|4^ we proceed as follows. From ((4T7|) and (|4?T8)) with t = 
we get that 

|iP(rA>/A)-e-^^/'I 

< \ip{ta > Ia) - e-c-^^(^)(/A-r(A))| ^ g-a/2|gai^(A)r(A) _ ^1 

9 

< -e{A) + nJ'(A) 

< 36(A) , 

and similarly 

\IPa{ta > Ia) - ae-^^/2| < 3e(A) . 

Applying the last two inequalities together with ()4.16l) . we get that the modulus 
in (|4.2ip is bounded by 

3e{A) kmax{]PA{rA > fA);JP{TA > /a); e-^-^/2|fc-i 

< 3e(A) fc(e-'^-^/2+3e(A))'^"^ . 

An argument similar to that used to bound (|4.24p let us conclude that the 
last expression is bounded by 

10e(A)J'(A)te-(^-^-i2.(A))i^(A)t 
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The modulus in (I4.22p is bounded using again (|4.17p when r > t{A) by 
{9/2)e{A). If r < t(A) then it can be rewritten as 

which is bounded by 2n]P{A). We conclude that (|4.22p is bounded by 

{9/2)e{A)e-'^^''/^ = {9/2)e{A)e-^^^^^^'^'-''^ < %t{A)IP{A)te-^^^^'^^' . 
This ends the proof of the theorem. □ 

Proof of Corollary 14.11 Let Y be the r.v. with distribution given by 
1 



P{Y > t) 



P{A) < t < 1P{A)t{A) 
t < 1P{A)t{A) 



(4.25) 



Then we can rewrite (I4.3P as 

\PAiIPiA)TA >t)- IP{]P{A)ta >t)\< Cie{A)J{A,t/IP{A)) 
Integrating (|4.25p we get 

\lEA{{IP{A)TAf)-IE{YP)\ 

= / f3t'^-^{lP{P{A)TA>t)-P{Y >t)) 

Jp{A) 

POO 

< / ptl^-^ \P {1P{A)ta >t)-lP{Y>t)\ 

Jp{A) 

poo 

< Cie{A) / f3tf-'f{A,t/lP{A))dt . 

JP(A) 

Now we compute IE (y^) = SpiA) Pt'^^^^iY > *)• We do it in each interval 
[]P{A),P{A)t{A)] and [IP { A) t{ A), oo). 

The first one is {IP{A)t{A))^ — IP{A)^ . The second one can be re-written 

as 

rP(A)TA'' 



Ca e 



(APiAjTA 



(4.26) 



Consider the exponent of the second factor in (|4.26[) . By definition we have 
<^aS^{A)ta < lP{A)n. Moreover, IP{A) decays exponentially fast on n. Then 
for the second factor we have 

^^CaP{A)ta - i| < CIP{A)n. Further, the first 
integral is r(/3 + 1)/C^- The second one is bounded by {IP[A)taY. We recall 
that the first factor in (|4.26p is C,a- We conclude that 



ie{yP) 



r(/3 + i) 

A/3-1 
^A 



< CnIP{A) + 2{nIP{A)f) < C{nIP{A))^l^''^^ 
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Similar computations give 

CI cr^ 



< 



In the last inequality we used x < 2{1 — e ^)for small enough a: > 0. This ends 
the proof of the corollary. □ 



Proof of Corollary 14.21 (a) <^ (d). It follows directly from Theorem 14. II 
(b) => (a) , (c) . It follows by Theorem 14.11 and Theorem 1 in [2] 
(a) (6) and (c) (b). They follow by Theorem [4T1 Theorem 1 in 2J and 

(|4.15p . The corollary is proved. □ 



5 SOJOURN TIME 

In this section wc consider the number of consecutive visits to a fixed string 
A and prove that the distribution law of this number can be well approximated 
by a geometric law. 

Definition 5.1 Let A G C". We define the sojourn time on the set A as the 
r.v. Sa ■■ ^ ~* ]N U {oo} 

Sa(x) = sup |fc e W I a; e a n T-^^^-^^A ; Vj = 1, . . . , /c} , 

and Sa (x) = if the supremum is taken over the empty set. 

Before to state our main result we have to introduce the following defini- 
tion about certain continuity property of the probability IP conditioned to i 
consecutive occurrences of the string A. 

Definition 5.2 For each fixed A G C", we define the sequence of probabilities 
(Pi(^))iejv as follows: 

p^iA)"^ ip{^ I f|r^"^(^)A 

// the limit lim„^oo Pi{A) exists then we denote it by p{A). 

Remark 5.1 By stationarity pi{A) = 1 — (a- 

In the following 2 examples, the sequence (pi(A))igjv not just converges but 
even is constant. 
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Example 5.1 For a i.i.d. Bernoulli process with parameter < 6 = IP{Xi = 
1) = 1 — IP{Xi = 0), and for the n-string A = {Xq~^ = 1}, we have that 
Pi{A) = l-C,A = for alii e IN. 

Example 5.2 Let {Xmjm^z ^ irreducible and aperiodic finite state Markov 
chain. For A = {Xq~^ = Og"^} € C", the sequence {Pi{A)).^j^f is constant. 
More precisely, by the Markovian property and for all i G IN 

Pi{A) = IP (^XllZli^A) = K-l{A)\^n-T{A)-l = a„-r(A)-l) 

n-l 

n ^(^i=«ii^i-i=«i-i) • 

j=n-T{A) 

The next is an example of a process with infinity memory and converging 

{pi{A))neM- 

Example 5.3 The following is a family of processes of the renewal type. Define 
{Xn)neN o,s the order one Markov chain over IN with transitions probabilities 
given by 

Q{n,n+l)=qn Q(n,0) = l-g„ Vn > 
Define the process 

Jo Xn = 
- \ 1 Xn^O 

The process (X„)„gjv is positive recurrent (and then (i^)neJVy' if and only if 
Y^kLoYij^olj < 00. Direct computations show that 

n k 

p(ro"-i = i) = ^n«^ Vnew. 

fe=0 j=0 

Now chose Qj such that P{Yq~^ = 1) = Q-n+s{n) ^^^^ g^^j g^^y converging 
sequence (to any real number) and such that \5{i + 1) — 5{i)\ < 1 for all i S IN . 
Take A = {Y^'^ = 1}. Thus t{A) = 1. Then 

p^{A) = e-i+^("+»+i)-^("+») and lim pi{A) = e'^ e (0, 1) . 

i—^oo 

In the foUowing theorem we assume that {pi{A))i^j\f converges with velocity 
di{A). Namely, there is a real number p{A) e [0, 1) such that 

\pi{A) - p{A)\ < di{A) for all i e IN, (5.1) 

where rf, is a sequence converging to zero. For simplicity we put d{A) = 
snp{di{A) \ i€lN}. 
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Theorem 5.1 Let {Xm},nfzz " stationary process. Let A G . Assume 
that i5.1\) holds. Then, there is c(A) G [0, 1), such that the following inequalities 
hold for all k <E M: 

k+l 

\IPa {Sa ^k)-{l- p{A)) p{A)>'\ < c(A)'= MA) < c(A)'=(fc + l)d{A) . 

We deduce immediately that the /3-moments of Sa can be approximated by 
IE(Y^) where y is a geometric random variable with parameter p{A). 

Corollary 5.1 LetY be a r.v. with geometric distribution with parameter p{A) . 
Let /3>0. Then 

OO 

fe=i 

Remark 5.2 The sum X]fc°=i k^'^^c{A)'' can be approximated using the Gamma 
function by r{f3 + 2)/(— lnc(A))''+^. When the supremum of the distances 
\pi{A) — p{A)\ is small, the approximations given by Theorem 15. Jl and Corol- 
laru \5.1\ are good. The smaller is c{A), the better they are. We compute these 
quantities for the examples of this section. 

Example 15.11 (continuation) It follows straight-forward from definitions that 
p^{A) = p{A) = JP(yl(^(^))) for aU i and for any A e C",7i G W. Thus 
c{A) = and d{A) = 0. 

Example 15.21 (continuation) We already compute that pi {A) = p{A) for all i 
and for any A e C", n e W. Thus c{A) = p{A) and 2{A) = 0. 

Example 15.31 (continuation) For the same n-string there considered, we have 

d,(A) = e-i|e^("+*+i)-*("+») - 1| < \5{n + i + -5{n + i)\ , 

and 

d{A) <svcp{\d{n + i + l) - 6{n + i)\,i e M} . 
So, for large enough n, d{A) is small. Finally, 

c{A) = max{e-i; sup e-i+'5(«+»+i)-'5("+*)} ^ (q, 1) . 

nGiV 

In the proof of Theorem 15.11 wc will use the following lemma. 

Lemma 5.1 Let {li)ii=i^ be a sequence of real numbers such that < li < 1, for 
all i e M. Let < I < 1 be such that \li - l\ < di for all i e M with di 0. 



Ea [s^a] - ^(Y^) 
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Then, there is a constant c G [0, 1), such that the following inequalities hold for 
all k e M: 



< c'^^^^di < k c^-^d . 



where d — supjdi, i £ IN]. 
Proof 



fc fe-l fe-l fe-2 k-2 



\{h w ki + n n + n - ^* 

i—l i—1 i—1 

k 



i—1 i—1 i—1 i—1 i—1 

k / k—i \ k 

^ En 



i=i y=i J i=i 
< k c^-^d , 

where c = max {/; supjgjv ^i}- '-' 

Proof of Theorem [57T] For fc = 0, wc just note that IP a {Sa = 0) = l-pi{A) 
and |1 - pi(yl) - (1 - p{A))\ < di{A). Suppose fc > 1. Therefore 

Pa {Sa = fc) 

= Pa if] T-^^'^^^A ■ T-'^^+^^^iA) j^c 
\i=o 

= IP I T^'^'^+^^'^^^M'^i Pi r"-'^('^M I WiP ( r~''^(^^A| Pi t"-'^('^M 



0=0 



3=0 



= {l-Pk+i{A))\{p.,{A) . 

i=l 

Third equahty follows by stationarity. Lemma lS.ll ends the proof of the theorem. 

□ 

Proof of Corollary 15.11 We use the inequality 

\IE (Xl^) - IE {Y^) \ <J2k'^ \^ ^ k) - IP (Y = k)\ , 

k>0 

which holds for any pair of positive r.v. X,Y. We apply the above inequality 
with X = Sa and Y geometrically distributed with parameter p{A). 

The exponential decay of the error term in Theorem 15.11 ends the proof of 
the corollary. □ 
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